Overview

Dataset statistics

Number of variables21
Number of observations3168
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory686.9 KiB
Average record size in memory222.0 B

Variable types

NUM20
CAT1

Reproduction

Analysis started2020-05-02 14:14:08.609344
Analysis finished2020-05-02 14:15:32.049407
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 2 (0.1%) duplicate rows Duplicates
median is highly correlated with meanfreq and 1 other fieldsHigh Correlation
meanfreq is highly correlated with median and 2 other fieldsHigh Correlation
Q25 is highly correlated with meanfreq and 1 other fieldsHigh Correlation
kurt is highly correlated with skewHigh Correlation
skew is highly correlated with kurtHigh Correlation
centroid is highly correlated with meanfreq and 2 other fieldsHigh Correlation
dfrange is highly correlated with maxdomHigh Correlation
maxdom is highly correlated with dfrangeHigh Correlation
mode has 236 (7.4%) zeros Zeros
dfrange has 65 (2.1%) zeros Zeros
modindx has 65 (2.1%) zeros Zeros

Variables

meanfreq
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1809066104
Minimum0.03936334258
Maximum0.2511237587
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.03936334258
5-th percentile0.1259677733
Q10.1636621363
median0.1848384094
Q30.1991460509
95-th percentile0.2291036805
Maximum0.2511237587
Range0.2117604161
Interquartile range (IQR)0.03548391458

Descriptive statistics

Standard deviation0.0299178379
Coefficient of variation (CV)0.1653772509
Kurtosis0.805160543
Mean0.1809066104
Median Absolute Deviation (MAD)0.02297305786
Skewness-0.617495272
Sum573.1121417
Variance0.0008950770245
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.03936334 0.07808134 0.11040551 0.13036129 0.1489268 ... 0.20155283 0.21265048 0.23706762 0.24340009 0.25112376], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2121899149 2 0.1%
 
0.2137323717 2 0.1%
 
0.2289027479 1 < 0.1%
 
0.1003994171 1 < 0.1%
 
0.1601551799 1 < 0.1%
 
0.2337442148 1 < 0.1%
 
0.1909706078 1 < 0.1%
 
0.1566055098 1 < 0.1%
 
0.1328546309 1 < 0.1%
 
0.1777227378 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.03936334258 1 < 0.1%
 
0.04825407519 1 < 0.1%
 
0.05964548674 1 < 0.1%
 
0.05978098496 1 < 0.1%
 
0.06218231186 1 < 0.1%
 
ValueCountFrequency (%) 
0.2511237587 1 < 0.1%
 
0.2496365929 1 < 0.1%
 
0.2470406841 1 < 0.1%
 
0.2443564469 1 < 0.1%
 
0.2435280366 1 < 0.1%
 

sd
Real number (ℝ≥0)

Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05712596491
Minimum0.01836324244
Maximum0.1152732467
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.01836324244
5-th percentile0.03161699632
Q10.04195354558
median0.05915511913
Q30.06702042292
95-th percentile0.08548695077
Maximum0.1152732467
Range0.0969100043
Interquartile range (IQR)0.02506687734

Descriptive statistics

Standard deviation0.01665224708
Coefficient of variation (CV)0.2915004956
Kurtosis-0.5217889483
Mean0.05712596491
Median Absolute Deviation (MAD)0.01333300849
Skewness0.1369163179
Sum180.9750568
Variance0.0002772973329
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01836324 0.0241353 0.02716882 0.03017243 0.03933148 ... 0.06394703 0.08026661 0.08959144 0.09604581 0.11527325], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.04319043089 2 0.1%
 
0.05770473882 2 0.1%
 
0.03732115187 1 < 0.1%
 
0.03714077727 1 < 0.1%
 
0.04305049121 1 < 0.1%
 
0.03329691933 1 < 0.1%
 
0.03230787169 1 < 0.1%
 
0.02456279029 1 < 0.1%
 
0.08024170239 1 < 0.1%
 
0.0374062371 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.01836324244 1 < 0.1%
 
0.02178199064 1 < 0.1%
 
0.02400166502 1 < 0.1%
 
0.02426893209 1 < 0.1%
 
0.02456279029 1 < 0.1%
 
ValueCountFrequency (%) 
0.1152732467 1 < 0.1%
 
0.1145080382 1 < 0.1%
 
0.1126491188 1 < 0.1%
 
0.1112604922 1 < 0.1%
 
0.1112569693 1 < 0.1%
 

median
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3077
Unique (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1856206765
Minimum0.01097457627
Maximum0.2612244898
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.01097457627
5-th percentile0.1163549305
Q10.1695925234
median0.1900323792
Q30.2106181268
95-th percentile0.2358259823
Maximum0.2612244898
Range0.2502499135
Interquartile range (IQR)0.04102560347

Descriptive statistics

Standard deviation0.03636014631
Coefficient of variation (CV)0.1958841386
Kurtosis1.629500928
Mean0.1856206765
Median Absolute Deviation (MAD)0.02746044974
Skewness-1.012784663
Sum588.0463031
Variance0.00132206024
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01097458 0.05818689 0.10229521 0.12372375 0.16179353 ... 0.20002092 0.22158891 0.23864256 0.24374139 0.26122449], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1866666667 6 0.2%
 
0.22 4 0.1%
 
0.1720315582 3 0.1%
 
0.1792 3 0.1%
 
0.1834482759 3 0.1%
 
0.2041325536 3 0.1%
 
0.2121212121 3 0.1%
 
0.1859854015 2 0.1%
 
0.1909204647 2 0.1%
 
0.1660319489 2 0.1%
 
Other values (3067) 3137 99.0%
 
ValueCountFrequency (%) 
0.01097457627 1 < 0.1%
 
0.01358752166 1 < 0.1%
 
0.01579030977 1 < 0.1%
 
0.02699468085 1 < 0.1%
 
0.02936129647 1 < 0.1%
 
ValueCountFrequency (%) 
0.2612244898 1 < 0.1%
 
0.2605405405 1 < 0.1%
 
0.2574170495 1 < 0.1%
 
0.2569835597 1 < 0.1%
 
0.2566312595 1 < 0.1%
 

Q25
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3103
Unique (%)97.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1404555905
Minimum0.0002287581699
Maximum0.2473469388
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.0002287581699
5-th percentile0.04358040895
Q10.1110865126
median0.1402864183
Q30.1759387716
95-th percentile0.2152441287
Maximum0.2473469388
Range0.2471181806
Interquartile range (IQR)0.06485225897

Descriptive statistics

Standard deviation0.04867971586
Coefficient of variation (CV)0.346584395
Kurtosis0.01833354583
Mean0.1404555905
Median Absolute Deviation (MAD)0.03838218151
Skewness-0.4908766849
Sum444.9633107
Variance0.002369714736
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.28758170e-04 3.17132418e-04 2.22154649e-02 8.62815137e-02 9.48661210e-02 ... 1.57797896e-01 1.84526081e-01 2.05832822e-01 2.29437186e-01 2.47346939e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.14 6 0.2%
 
0.1716129032 3 0.1%
 
0.2036363636 3 0.1%
 
0.1590669676 2 0.1%
 
0.1686902928 2 0.1%
 
0.1480888889 2 0.1%
 
0.2163636364 2 0.1%
 
0.1889570552 2 0.1%
 
0.103127572 2 0.1%
 
0.1613452915 2 0.1%
 
Other values (3093) 3142 99.2%
 
ValueCountFrequency (%) 
0.0002287581699 1 < 0.1%
 
0.0002354920101 1 < 0.1%
 
0.0002395209581 1 < 0.1%
 
0.0002502234138 1 < 0.1%
 
0.000266920877 2 0.1%
 
ValueCountFrequency (%) 
0.2473469388 1 < 0.1%
 
0.2421235324 1 < 0.1%
 
0.240735194 1 < 0.1%
 
0.2405416249 1 < 0.1%
 
0.2394594595 1 < 0.1%
 

Q75
Real number (ℝ≥0)

Distinct count3034
Unique (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2247649614
Minimum0.04294627383
Maximum0.2734693878
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.04294627383
5-th percentile0.1874137339
Q10.2087466155
median0.2256842149
Q30.2436604825
95-th percentile0.2576779694
Maximum0.2734693878
Range0.2305231139
Interquartile range (IQR)0.03491386695

Descriptive statistics

Standard deviation0.02363927828
Coefficient of variation (CV)0.1051733248
Kurtosis2.981810301
Mean0.2247649614
Median Absolute Deviation (MAD)0.0189401668
Skewness-0.9003108148
Sum712.0553978
Variance0.0005588154777
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.04294627 0.12795167 0.1654361 0.18059071 0.19479078 0.20625477 0.25893784 0.26676175 0.27346939], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.24 5 0.2%
 
0.2418181818 5 0.2%
 
0.245 4 0.1%
 
0.2290909091 4 0.1%
 
0.2488888889 4 0.1%
 
0.2333333333 4 0.1%
 
0.2413793103 4 0.1%
 
0.2047311828 3 0.1%
 
0.2549253731 3 0.1%
 
0.2477808219 3 0.1%
 
Other values (3024) 3129 98.8%
 
ValueCountFrequency (%) 
0.04294627383 1 < 0.1%
 
0.05826846704 1 < 0.1%
 
0.07595744681 1 < 0.1%
 
0.09019343987 1 < 0.1%
 
0.09266619014 1 < 0.1%
 
ValueCountFrequency (%) 
0.2734693878 1 < 0.1%
 
0.2698517298 1 < 0.1%
 
0.2689373297 1 < 0.1%
 
0.2689240506 1 < 0.1%
 
0.2687919943 1 < 0.1%
 

IQR
Real number (ℝ≥0)

Distinct count3073
Unique (%)97.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.08430937093
Minimum0.01455773126
Maximum0.2522252011
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.01455773126
5-th percentile0.0254866894
Q10.04255973444
median0.09427995392
Q30.1141750866
95-th percentile0.1563189556
Maximum0.2522252011
Range0.2376674698
Interquartile range (IQR)0.07161535212

Descriptive statistics

Standard deviation0.04278305438
Coefficient of variation (CV)0.5074531326
Kurtosis-0.448160298
Mean0.08430937093
Median Absolute Deviation (MAD)0.03712007129
Skewness0.2954323558
Sum267.0920871
Variance0.001830389742
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01455773 0.0204069 0.02450128 0.03949231 0.05240991 ... 0.09899928 0.11948 0.12756913 0.18159735 0.2522252 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.035 4 0.1%
 
0.04307692308 4 0.1%
 
0.105 4 0.1%
 
0.02333333333 3 0.1%
 
0.02488888889 3 0.1%
 
0.105844504 3 0.1%
 
0.04666666667 3 0.1%
 
0.102 2 0.1%
 
0.09739130435 2 0.1%
 
0.09435736677 2 0.1%
 
Other values (3063) 3138 99.1%
 
ValueCountFrequency (%) 
0.01455773126 1 < 0.1%
 
0.01492248062 1 < 0.1%
 
0.01511111111 1 < 0.1%
 
0.01549100968 1 < 0.1%
 
0.01658536585 1 < 0.1%
 
ValueCountFrequency (%) 
0.2522252011 1 < 0.1%
 
0.2487702574 1 < 0.1%
 
0.2481916538 1 < 0.1%
 
0.2459652707 1 < 0.1%
 
0.245300286 1 < 0.1%
 

skew
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.14016752
Minimum0.1417354241
Maximum34.72545327
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.1417354241
5-th percentile1.122956172
Q11.649568695
median2.197100657
Q32.931694047
95-th percentile6.918370952
Maximum34.72545327
Range34.58371784
Interquartile range (IQR)1.282125352

Descriptive statistics

Standard deviation4.240528713
Coefficient of variation (CV)1.350414806
Kurtosis25.36344634
Mean3.14016752
Median Absolute Deviation (MAD)1.837596959
Skewness4.933314185
Sum9948.050704
Variance17.98208377
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.14173542 0.69080647 0.961686 1.08766507 1.31926702 ... 3.57921917 4.15108919 5.68273859 7.98144987 34.72545327], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1.862572809 2 0.1%
 
2.113598278 2 0.1%
 
2.163847944 1 < 0.1%
 
1.522354093 1 < 0.1%
 
2.052550657 1 < 0.1%
 
2.559214753 1 < 0.1%
 
1.430503585 1 < 0.1%
 
2.21228525 1 < 0.1%
 
3.030983794 1 < 0.1%
 
1.408092868 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.1417354241 1 < 0.1%
 
0.2850202853 1 < 0.1%
 
0.3260330336 1 < 0.1%
 
0.529583803 1 < 0.1%
 
0.5487427053 1 < 0.1%
 
ValueCountFrequency (%) 
34.72545327 1 < 0.1%
 
34.53748756 1 < 0.1%
 
33.56633753 1 < 0.1%
 
33.16730036 1 < 0.1%
 
32.35073927 1 < 0.1%
 

kurt
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.56846079
Minimum2.068455491
Maximum1309.612887
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum2.068455491
5-th percentile3.755363316
Q15.669546856
median8.318463289
Q313.64890532
95-th percentile75.16913085
Maximum1309.612887
Range1307.544432
Interquartile range (IQR)7.979358464

Descriptive statistics

Standard deviation134.9286612
Coefficient of variation (CV)3.689755005
Kurtosis35.93212929
Mean36.56846079
Median Absolute Deviation (MAD)49.68604261
Skewness5.872586435
Sum115848.8838
Variance18205.74362
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2.06845549 2.58779613 2.99506375 3.94741564 6.9236621 ... 45.91458277 85.80720788 162.97692038 1035.22810321 1309.61288737], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6.109790286 2 0.1%
 
7.890926985 2 0.1%
 
3.820472506 1 < 0.1%
 
7.406555457 1 < 0.1%
 
11.5090583 1 < 0.1%
 
45.89524782 1 < 0.1%
 
810.4492034 1 < 0.1%
 
21.96557969 1 < 0.1%
 
9.11888912 1 < 0.1%
 
15.23222875 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
2.068455491 1 < 0.1%
 
2.209672772 1 < 0.1%
 
2.269432223 1 < 0.1%
 
2.293368 1 < 0.1%
 
2.46256132 1 < 0.1%
 
ValueCountFrequency (%) 
1309.612887 1 < 0.1%
 
1271.353628 1 < 0.1%
 
1202.684552 1 < 0.1%
 
1193.434066 1 < 0.1%
 
1128.534782 1 < 0.1%
 

sp.ent
Real number (ℝ≥0)

Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8951270643
Minimum0.7386506862
Maximum0.981996589
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.7386506862
5-th percentile0.8167506621
Q10.8618109811
median0.9017668303
Q30.9287134567
95-th percentile0.962986275
Maximum0.981996589
Range0.2433459027
Interquartile range (IQR)0.06690247559

Descriptive statistics

Standard deviation0.0449795184
Coefficient of variation (CV)0.05024931117
Kurtosis-0.423924736
Mean0.8951270643
Median Absolute Deviation (MAD)0.03718758282
Skewness-0.4309339825
Sum2835.76254
Variance0.002023157075
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.73865069 0.7711637 0.79946337 0.81671027 0.84265383 ... 0.92799248 0.94364586 0.96924199 0.97649779 0.98199659], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.8776688689 2 0.1%
 
0.8597123484 2 0.1%
 
0.9521931179 1 < 0.1%
 
0.8368828552 1 < 0.1%
 
0.9642917208 1 < 0.1%
 
0.8499423737 1 < 0.1%
 
0.891770807 1 < 0.1%
 
0.9120867575 1 < 0.1%
 
0.8777774775 1 < 0.1%
 
0.9130407629 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.7386506862 1 < 0.1%
 
0.7475694665 1 < 0.1%
 
0.7476948697 1 < 0.1%
 
0.748495008 1 < 0.1%
 
0.7486763795 1 < 0.1%
 
ValueCountFrequency (%) 
0.981996589 1 < 0.1%
 
0.9784817901 1 < 0.1%
 
0.9765329702 1 < 0.1%
 
0.9764626195 1 < 0.1%
 
0.976355461 1 < 0.1%
 

sfm
Real number (ℝ≥0)

Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4082164114
Minimum0.03687647451
Maximum0.8429359314
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.03687647451
5-th percentile0.1584453164
Q10.2580405215
median0.3963351568
Q30.5336761646
95-th percentile0.7328248346
Maximum0.8429359314
Range0.8060594569
Interquartile range (IQR)0.2756356431

Descriptive statistics

Standard deviation0.177521105
Coefficient of variation (CV)0.4348700837
Kurtosis-0.8359339024
Mean0.4082164114
Median Absolute Deviation (MAD)0.1499074873
Skewness0.339957584
Sum1293.229591
Variance0.03151374273
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.03687647 0.08060046 0.11729943 0.16490454 0.32469214 0.3860432 0.54420601 0.61476266 0.78752155 0.84293593], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.3143976797 2 0.1%
 
0.08493436355 2 0.1%
 
0.5809385427 1 < 0.1%
 
0.2197099945 1 < 0.1%
 
0.7598781215 1 < 0.1%
 
0.4131398555 1 < 0.1%
 
0.4895337052 1 < 0.1%
 
0.3873481721 1 < 0.1%
 
0.6816032061 1 < 0.1%
 
0.3696063141 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.03687647451 1 < 0.1%
 
0.08023747084 1 < 0.1%
 
0.08096344338 1 < 0.1%
 
0.08220408541 1 < 0.1%
 
0.08265560635 1 < 0.1%
 
ValueCountFrequency (%) 
0.8429359314 1 < 0.1%
 
0.8313468673 1 < 0.1%
 
0.8260991385 1 < 0.1%
 
0.8226706545 1 < 0.1%
 
0.8225866361 1 < 0.1%
 

mode
Real number (ℝ≥0)

ZEROS
Distinct count2825
Unique (%)89.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1652817968
Minimum0
Maximum0.28
Zeros236
Zeros (%)7.4%
Memory size24.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.1180157757
median0.1865986395
Q30.2211041346
95-th percentile0.2608128743
Maximum0.28
Range0.28
Interquartile range (IQR)0.1030883589

Descriptive statistics

Standard deviation0.07720301386
Coefficient of variation (CV)0.4670993139
Kurtosis-0.2559077036
Mean0.1652817968
Median Absolute Deviation (MAD)0.06227791588
Skewness-0.8372359937
Sum523.6127321
Variance0.005960305348
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.00036395 0.0227428 0.0498901 0.05005411 ... 0.2006304 0.24562005 0.26160889 0.27985169 0.28 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 236 7.4%
 
0.28 10 0.3%
 
0.06004765687 5 0.2%
 
0.0600536193 5 0.2%
 
0.1866666667 5 0.2%
 
0.05006165228 4 0.1%
 
0.05008797654 4 0.1%
 
0.05005586592 4 0.1%
 
0.05005988024 4 0.1%
 
0.05011424219 4 0.1%
 
Other values (2815) 2887 91.1%
 
ValueCountFrequency (%) 
0 236 7.4%
 
0.0007279029463 1 < 0.1%
 
0.0007749077491 1 < 0.1%
 
0.0008007626311 1 < 0.1%
 
0.0008427389014 1 < 0.1%
 
ValueCountFrequency (%) 
0.28 10 0.3%
 
0.2797033898 1 < 0.1%
 
0.2795851852 1 < 0.1%
 
0.2795229983 1 < 0.1%
 
0.2791181102 1 < 0.1%
 

centroid
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1809066104
Minimum0.03936334258
Maximum0.2511237587
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.03936334258
5-th percentile0.1259677733
Q10.1636621363
median0.1848384094
Q30.1991460509
95-th percentile0.2291036805
Maximum0.2511237587
Range0.2117604161
Interquartile range (IQR)0.03548391458

Descriptive statistics

Standard deviation0.0299178379
Coefficient of variation (CV)0.1653772509
Kurtosis0.805160543
Mean0.1809066104
Median Absolute Deviation (MAD)0.02297305786
Skewness-0.617495272
Sum573.1121417
Variance0.0008950770245
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.03936334 0.07808134 0.11040551 0.13036129 0.1489268 ... 0.20155283 0.21265048 0.23706762 0.24340009 0.25112376], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2121899149 2 0.1%
 
0.2137323717 2 0.1%
 
0.2289027479 1 < 0.1%
 
0.1003994171 1 < 0.1%
 
0.1601551799 1 < 0.1%
 
0.2337442148 1 < 0.1%
 
0.1909706078 1 < 0.1%
 
0.1566055098 1 < 0.1%
 
0.1328546309 1 < 0.1%
 
0.1777227378 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.03936334258 1 < 0.1%
 
0.04825407519 1 < 0.1%
 
0.05964548674 1 < 0.1%
 
0.05978098496 1 < 0.1%
 
0.06218231186 1 < 0.1%
 
ValueCountFrequency (%) 
0.2511237587 1 < 0.1%
 
0.2496365929 1 < 0.1%
 
0.2470406841 1 < 0.1%
 
0.2443564469 1 < 0.1%
 
0.2435280366 1 < 0.1%
 

meanfun
Real number (ℝ≥0)

Distinct count3166
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1428067343
Minimum0.05556534931
Maximum0.2376363873
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.05556534931
5-th percentile0.09362709442
Q10.1169984856
median0.140518518
Q30.1695806199
95-th percentile0.1934318975
Maximum0.2376363873
Range0.182071038
Interquartile range (IQR)0.05258213425

Descriptive statistics

Standard deviation0.03230443258
Coefficient of variation (CV)0.2262108488
Kurtosis-0.8599596486
Mean0.1428067343
Median Absolute Deviation (MAD)0.02782210416
Skewness0.03914069149
Sum452.4117342
Variance0.001043576364
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.05556535 0.07132125 0.08973726 0.10158269 0.102153 ... 0.18294704 0.19388853 0.20162762 0.2169978 0.23763639], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1336673026 2 0.1%
 
0.1399424783 2 0.1%
 
0.1650750785 1 < 0.1%
 
0.1102603026 1 < 0.1%
 
0.1077247283 1 < 0.1%
 
0.1772383118 1 < 0.1%
 
0.1545285116 1 < 0.1%
 
0.1740093295 1 < 0.1%
 
0.1432953725 1 < 0.1%
 
0.1855433845 1 < 0.1%
 
Other values (3156) 3156 99.6%
 
ValueCountFrequency (%) 
0.05556534931 1 < 0.1%
 
0.05704523917 1 < 0.1%
 
0.06096570749 1 < 0.1%
 
0.06254164386 1 < 0.1%
 
0.06347517951 1 < 0.1%
 
ValueCountFrequency (%) 
0.2376363873 1 < 0.1%
 
0.2311352895 1 < 0.1%
 
0.22915253 1 < 0.1%
 
0.2257554714 1 < 0.1%
 
0.2234170486 1 < 0.1%
 

minfun
Real number (ℝ≥0)

Distinct count913
Unique (%)28.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.03680180834
Minimum0.009775171065
Maximum0.2040816327
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.009775171065
5-th percentile0.0157946693
Q10.01822323462
median0.04610951009
Q30.04790419162
95-th percentile0.0564414737
Maximum0.2040816327
Range0.1943064616
Interquartile range (IQR)0.02968095699

Descriptive statistics

Standard deviation0.01921995215
Coefficient of variation (CV)0.5222556449
Kurtosis10.75808575
Mean0.03680180834
Median Absolute Deviation (MAD)0.01554853254
Skewness1.878003958
Sum116.5881288
Variance0.0003694065605
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00977517 0.01563264 0.01635156 0.01685098 0.01796744 ... 0.05186387 0.05514074 0.07045356 0.10050505 0.20408163], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.04692082111 64 2.0%
 
0.04710500491 59 1.9%
 
0.04701273262 53 1.7%
 
0.0469667319 47 1.5%
 
0.04705882353 46 1.5%
 
0.04729064039 46 1.5%
 
0.04715127701 41 1.3%
 
0.04719764012 40 1.3%
 
0.0473840079 40 1.3%
 
0.04747774481 35 1.1%
 
Other values (903) 2697 85.1%
 
ValueCountFrequency (%) 
0.009775171065 1 < 0.1%
 
0.009784735812 1 < 0.1%
 
0.009900990099 1 < 0.1%
 
0.009910802775 1 < 0.1%
 
0.01016260163 1 < 0.1%
 
ValueCountFrequency (%) 
0.2040816327 1 < 0.1%
 
0.2 2 0.1%
 
0.1851851852 1 < 0.1%
 
0.1785714286 1 < 0.1%
 
0.1684210526 1 < 0.1%
 

maxfun
Real number (ℝ≥0)

Distinct count123
Unique (%)3.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2588422458
Minimum0.1030927835
Maximum0.2791139241
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.1030927835
5-th percentile0.1924698795
Q10.253968254
median0.2711864407
Q30.2774566474
95-th percentile0.2790697674
Maximum0.2791139241
Range0.1760211405
Interquartile range (IQR)0.02348839343

Descriptive statistics

Standard deviation0.03007730942
Coefficient of variation (CV)0.1161993837
Kurtosis5.203917878
Mean0.2588422458
Median Absolute Deviation (MAD)0.02149547188
Skewness-2.238534771
Sum820.0122346
Variance0.0009046445422
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.10309278 0.13734159 0.19139194 0.21164614 0.22792208 ... 0.27740757 0.27761721 0.27842377 0.27909185 0.27911392], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2790697674 532 16.8%
 
0.275862069 462 14.6%
 
0.2774566474 339 10.7%
 
0.2711864407 235 7.4%
 
0.2666666667 159 5.0%
 
0.262295082 127 4.0%
 
0.2742857143 107 3.4%
 
0.25 101 3.2%
 
0.2580645161 86 2.7%
 
0.253968254 62 2.0%
 
Other values (113) 958 30.2%
 
ValueCountFrequency (%) 
0.1030927835 1 < 0.1%
 
0.1052631579 1 < 0.1%
 
0.1086956522 1 < 0.1%
 
0.1111111111 1 < 0.1%
 
0.1123595506 1 < 0.1%
 
ValueCountFrequency (%) 
0.2791139241 6 0.2%
 
0.2790697674 532 16.8%
 
0.2777777778 39 1.2%
 
0.2774566474 339 10.7%
 
0.2773584906 3 0.1%
 

meandom
Real number (ℝ≥0)

Distinct count2999
Unique (%)94.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8292109597
Minimum0.0078125
Maximum2.957682292
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.0078125
5-th percentile0.1044901207
Q10.4198279748
median0.765794837
Q31.177165904
95-th percentile1.800401651
Maximum2.957682292
Range2.949869792
Interquartile range (IQR)0.7573379297

Descriptive statistics

Standard deviation0.5252050333
Coefficient of variation (CV)0.6333792712
Kurtosis-0.05477253025
Mean0.8292109597
Median Absolute Deviation (MAD)0.428963389
Skewness0.6110224344
Sum2626.94032
Variance0.275840327
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0078125 0.00789561 0.06431236 0.14281385 0.93237744 1.38965343 1.62380642 2.00370593 2.28736111 2.95768229], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.0078125 61 1.9%
 
0.1953125 4 0.1%
 
0.71875 4 0.1%
 
0.0703125 4 0.1%
 
0.4609375 4 0.1%
 
0.6875 3 0.1%
 
1.372395833 3 0.1%
 
0.3678385417 3 0.1%
 
1.0078125 3 0.1%
 
0.2578125 3 0.1%
 
Other values (2989) 3076 97.1%
 
ValueCountFrequency (%) 
0.0078125 61 1.9%
 
0.007978723404 1 < 0.1%
 
0.007990056818 1 < 0.1%
 
0.00818452381 1 < 0.1%
 
0.008246527778 1 < 0.1%
 
ValueCountFrequency (%) 
2.957682292 1 < 0.1%
 
2.805245536 1 < 0.1%
 
2.676988636 1 < 0.1%
 
2.591579861 1 < 0.1%
 
2.544270833 1 < 0.1%
 

mindom
Real number (ℝ≥0)

Distinct count77
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05264704541
Minimum0.0048828125
Maximum0.458984375
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.0048828125
5-th percentile0.0078125
Q10.0078125
median0.0234375
Q30.0703125
95-th percentile0.1875
Maximum0.458984375
Range0.4541015625
Interquartile range (IQR)0.0625

Descriptive statistics

Standard deviation0.06329947812
Coefficient of variation (CV)1.202336762
Kurtosis2.187585993
Mean0.05264704541
Median Absolute Deviation (MAD)0.04989571221
Skewness1.661113783
Sum166.7858398
Variance0.004006823931
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00488281 0.00634766 0.01123047 0.01513672 0.01757812 ... 0.21289062 0.23046875 0.23828125 0.26953125 0.45898438], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.0234375 1246 39.3%
 
0.0078125 814 25.7%
 
0.1640625 109 3.4%
 
0.0546875 63 2.0%
 
0.0048828125 61 1.9%
 
0.09375 46 1.5%
 
0.15625 46 1.5%
 
0.2109375 45 1.4%
 
0.0703125 42 1.3%
 
0.015625 38 1.2%
 
Other values (67) 658 20.8%
 
ValueCountFrequency (%) 
0.0048828125 61 1.9%
 
0.0078125 814 25.7%
 
0.0146484375 2 0.1%
 
0.015625 38 1.2%
 
0.01953125 1 < 0.1%
 
ValueCountFrequency (%) 
0.458984375 1 < 0.1%
 
0.44921875 1 < 0.1%
 
0.400390625 1 < 0.1%
 
0.3515625 1 < 0.1%
 
0.34375 1 < 0.1%
 

maxdom
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1054
Unique (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.047276738
Minimum0.0078125
Maximum21.8671875
Zeros0
Zeros (%)0.0%
Memory size24.9 KiB

Quantile statistics

Minimum0.0078125
5-th percentile0.3125
Q12.0703125
median4.9921875
Q37.0078125
95-th percentile10.640625
Maximum21.8671875
Range21.859375
Interquartile range (IQR)4.9375

Descriptive statistics

Standard deviation3.521156612
Coefficient of variation (CV)0.6976349415
Kurtosis1.31473759
Mean5.047276738
Median Absolute Deviation (MAD)2.791655732
Skewness0.7261889465
Sum15989.77271
Variance12.39854388
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[7.81250000e-03 1.17187500e-02 1.82617188e-01 6.36718750e-01 8.61816406e-01 ... 9.52734375e+00 1.02304688e+01 1.20000000e+01 1.20703125e+01 2.18671875e+01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.0078125 61 1.9%
 
7 21 0.7%
 
5.15625 16 0.5%
 
12.0234375 15 0.5%
 
0.7861328125 13 0.4%
 
0.5625 13 0.4%
 
0.6953125 13 0.4%
 
5.2734375 11 0.3%
 
0.734375 11 0.3%
 
5.390625 11 0.3%
 
Other values (1044) 2983 94.2%
 
ValueCountFrequency (%) 
0.0078125 61 1.9%
 
0.015625 3 0.1%
 
0.0234375 1 < 0.1%
 
0.0546875 1 < 0.1%
 
0.0703125 4 0.1%
 
ValueCountFrequency (%) 
21.8671875 1 < 0.1%
 
21.84375 1 < 0.1%
 
21.796875 1 < 0.1%
 
21.5625 1 < 0.1%
 
21.515625 1 < 0.1%
 

dfrange
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count1091
Unique (%)34.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.994629692
Minimum0
Maximum21.84375
Zeros65
Zeros (%)2.1%
Memory size24.9 KiB

Quantile statistics

Minimum0
5-th percentile0.265625
Q12.044921875
median4.9453125
Q36.9921875
95-th percentile10.60898437
Maximum21.84375
Range21.84375
Interquartile range (IQR)4.947265625

Descriptive statistics

Standard deviation3.52003912
Coefficient of variation (CV)0.7047647847
Kurtosis1.318012674
Mean4.994629692
Median Absolute Deviation (MAD)2.788782782
Skewness0.7282610635
Sum15822.98687
Variance12.39067541
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 3.90625000e-03 1.67968750e-01 5.05371094e-01 6.22558594e-01 ... 7.96875000e+00 9.45703125e+00 1.02890625e+01 1.20351562e+01 2.18437500e+01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 65 2.1%
 
5.1328125 15 0.5%
 
0.625 15 0.5%
 
0.6796875 14 0.4%
 
3.75 13 0.4%
 
0.6640625 12 0.4%
 
0.640625 11 0.3%
 
0.703125 11 0.3%
 
8.8828125 11 0.3%
 
6 10 0.3%
 
Other values (1081) 2991 94.4%
 
ValueCountFrequency (%) 
0 65 2.1%
 
0.0078125 3 0.1%
 
0.015625 1 < 0.1%
 
0.01953125 2 0.1%
 
0.0244140625 1 < 0.1%
 
ValueCountFrequency (%) 
21.84375 1 < 0.1%
 
21.8203125 1 < 0.1%
 
21.7734375 1 < 0.1%
 
21.5390625 1 < 0.1%
 
21.4921875 1 < 0.1%
 

modindx
Real number (ℝ≥0)

ZEROS
Distinct count3079
Unique (%)97.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1737515061
Minimum0
Maximum0.9323741007
Zeros65
Zeros (%)2.1%
Memory size24.9 KiB

Quantile statistics

Minimum0
5-th percentile0.05774969079
Q10.09976580594
median0.1393570226
Q30.2091832491
95-th percentile0.4055158151
Maximum0.9323741007
Range0.9323741007
Interquartile range (IQR)0.1094174432

Descriptive statistics

Standard deviation0.1194543894
Coefficient of variation (CV)0.6875013176
Kurtosis5.924935217
Mean0.1737515061
Median Absolute Deviation (MAD)0.08435563114
Skewness2.064334578
Sum550.4447715
Variance0.01426935115
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.00994068 0.03539158 0.04622135 0.06924954 ... 0.29504931 0.3567186 0.41657246 0.63456353 0.9323741 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 65 2.1%
 
0.2 3 0.1%
 
0.05263157895 3 0.1%
 
0.1333333333 3 0.1%
 
0.1666666667 3 0.1%
 
0.1176470588 3 0.1%
 
0.1462053571 2 0.1%
 
0.1923076923 2 0.1%
 
0.1722488038 2 0.1%
 
0.05 2 0.1%
 
Other values (3069) 3080 97.2%
 
ValueCountFrequency (%) 
0 65 2.1%
 
0.01988135321 1 < 0.1%
 
0.02164750958 1 < 0.1%
 
0.02194357367 1 < 0.1%
 
0.02216748768 1 < 0.1%
 
ValueCountFrequency (%) 
0.9323741007 1 < 0.1%
 
0.8795031056 1 < 0.1%
 
0.8577642453 1 < 0.1%
 
0.8547008547 1 < 0.1%
 
0.8444827586 1 < 0.1%
 

label
Categorical

UNIFORM
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.9 KiB
female
1584
male
1584
ValueCountFrequency (%) 
female 1584 50.0%
 
male 1584 50.0%
 

Length

Max length6
Mean length5
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 5 100.0%
 
ValueCountFrequency (%) 
Latin 5 100.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

meanfreqsdmedianQ25Q75IQRskewkurtsp.entsfmmodecentroidmeanfunminfunmaxfunmeandommindommaxdomdfrangemodindxlabel
00.0597810.0642410.0320270.0150710.0901930.07512212.863462274.4029060.8933690.4919180.0000000.0597810.0842790.0157020.2758620.0078120.0078120.0078120.0000000.000000male
10.0660090.0673100.0402290.0194140.0926660.07325222.423285634.6138550.8921930.5137240.0000000.0660090.1079370.0158260.2500000.0090140.0078120.0546880.0468750.052632male
20.0773160.0838290.0367180.0087010.1319080.12320730.7571551024.9277050.8463890.4789050.0000000.0773160.0987060.0156560.2711860.0079900.0078120.0156250.0078120.046512male
30.1512280.0721110.1580110.0965820.2079550.1113741.2328314.1772960.9633220.7272320.0838780.1512280.0889650.0177980.2500000.2014970.0078120.5625000.5546880.247119male
40.1351200.0791460.1246560.0787200.2060450.1273251.1011744.3337130.9719550.7835680.1042610.1351200.1063980.0169310.2666670.7128120.0078125.4843755.4765620.208274male
50.1327860.0795570.1190900.0679580.2095920.1416341.9325628.3088950.9631810.7383070.1125550.1327860.1101320.0171120.2539680.2982220.0078122.7265622.7187500.125160male
60.1507620.0744630.1601060.0928990.2057180.1128191.5306435.9874980.9675730.7626380.0861970.1507620.1059450.0262300.2666670.4796200.0078125.3125005.3046880.123992male
70.1605140.0767670.1443370.1105320.2319620.1214301.3971564.7666110.9592550.7198580.1283240.1605140.0930520.0177580.1441440.3013390.0078120.5390620.5312500.283937male
80.1422390.0780180.1385870.0882060.2085870.1203811.0997464.0702840.9707230.7709920.2191030.1422390.0967290.0179570.2500000.3364760.0078122.1640622.1562500.148272male
90.1343290.0803500.1214510.0755800.2019570.1263771.1903684.7873100.9752460.8045050.0116990.1343290.1058810.0193000.2622950.3403650.0156254.6953124.6796880.089920male

Last rows

meanfreqsdmedianQ25Q75IQRskewkurtsp.entsfmmodecentroidmeanfunminfunmaxfunmeandommindommaxdomdfrangemodindxlabel
31580.1836670.0406070.1825340.1564800.2076460.0511662.0541387.4830190.8981380.3139250.1770400.1836670.1492370.0186480.2622950.5503120.0078123.4218753.4140620.166503female
31590.1687940.0858420.1889800.0955580.2402290.1446711.4622485.0779560.9562010.7068610.1844420.1687940.1828630.0206990.2711860.9882810.0078125.8828125.8750000.268617female
31600.1517710.0891470.1859700.0581590.2301990.1720401.2277104.3043540.9620450.7445900.2305470.1517710.2016000.0234260.2666670.7667410.0078124.0078124.0000000.192220female
31610.1706560.0812370.1842770.1130120.2390960.1260841.3782565.4316630.9507500.6585580.1615060.1706560.1984750.1600000.2539680.4140620.0078120.7343750.7265620.336918female
31620.1460230.0925250.1834340.0417470.2243370.1825901.3849815.1189270.9489990.6598250.2154820.1460230.1956400.0395060.2758620.5338540.0078122.9921882.9843750.258924female
31630.1318840.0847340.1537070.0492850.2011440.1518591.7621296.6303830.9629340.7631820.2008360.1318840.1827900.0837700.2622950.8328990.0078124.2109384.2031250.161929female
31640.1162210.0892210.0767580.0427180.2049110.1621930.6937302.5039540.9607160.7095700.0136830.1162210.1889800.0344090.2758620.9098560.0390623.6796883.6406250.277897female
31650.1420560.0957980.1837310.0334240.2243600.1909361.8765026.6045090.9468540.6541960.0080060.1420560.2099180.0395060.2758620.4942710.0078122.9375002.9296880.194759female
31660.1436590.0906280.1849760.0435080.2199430.1764351.5910655.3882980.9504360.6754700.2122020.1436590.1723750.0344830.2500000.7913600.0078123.5937503.5859380.311002female
31670.1655090.0928840.1830440.0700720.2508270.1807561.7050295.7691150.9388290.6015290.2677020.1655090.1856070.0622570.2711860.2270220.0078120.5546880.5468750.350000female